Using training regimens to teach expanding function approximators

نویسندگان

  • Peng Zang
  • Arya Irani
  • Peng Zhou
  • Charles Lee Isbell
  • Andrea Lockerd Thomaz
چکیده

In complex real-world environments, traditional (tabular) techniques for solving Reinforcement Learning (RL) do not scale. Function approximation is needed, but unfortunately, existing approaches generally have poor convergence and optimality guarantees. Additionally, for the case of human environments, it is valuable to be able to leverage human input. In this paper we introduce Expanding Value Function Approximation (EVFA), a function approximation algorithm that returns the optimal value function given sufficient rounds. To leverage human input, we introduce a new human-agent interaction scheme, training regimens, which allow humans to interact with and improve agent learning in the setting of a machine learning game. In experiments, we show EVFA compares favorably to standard value approximation approaches. We also show that training regimens enable humans to further improve EVFA performance. In our user study, we find that non-experts are able to provide effective regimens and that they found the game fun.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of the Effect of Two Teach-Back Training and Pictorial Training Methods on Medication Adherence in Heart Failure Patients

Background & Aim: Medicinal nonadherence prevents the achievement of therapeutic goals in cardiovascular patients. Training is essential to increase medicinal adherence. Therefore, the present study compared the effect of two teach-back and pictorial training methods on the medication adherence in heart failure patients. Methods: This quasi-experimental study was performed on 210 heart failure...

متن کامل

A Model-Based Actor-Critic Algorithm in Continuous Time and Space

This paper presents a model-based actorcritic algorithm in continuous time and space. Two function approximators are used: one learns the policy (the actor) and the other learns the state-value function (the critic). The critic learns with the TD(λ) algorithm and the actor by gradient ascent on the Hamiltonian. A similar algorithm had been proposed by Doya, but this one is more general. This al...

متن کامل

Combination of Subtractive Clustering and Radial Basis Function in Speaker Identification

Speaker identification is the process of determining which registered speaker provides a given utterance. Speaker identification required to make a claim on the identity of speaker from the Ns trained speaker in its user database. In this study, we propose the combination of clustering algorithm and the classification technique – subtractive and Radial Basis Function (RBF). The proposed techniq...

متن کامل

The Nominations with the Inherent and Adherent Approximators in the German Language

The article deals with the description of the conceptual and linguistic nature, the modus character and the field structure of the linguistic category of the approximation. It also deals with the analysis of the role of the inherent and adherent approximators as well as the context in the realization of the invariant meaning of the approximation in the German language. Pragmatic potential of th...

متن کامل

High-accuracy value-function approximation with neural networks applied to the acrobot

Several reinforcement-learning techniques have already been applied to the Acrobot control problem, using linear function approximators to estimate the value function. In this paper, we present experimental results obtained by using a feedforward neural network instead. The learning algorithm used was model-based continuous TD(λ). It generated an efficient controller, producing a high-accuracy ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010